Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 26458 |
| Missing cells | 192211 |
| Missing cells (%) | 40.4% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 3.6 MiB |
| Average record size in memory | 144.0 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 8 |
2013 is highly correlated with 2014 and 6 other fields | High correlation |
2014 is highly correlated with 2013 and 6 other fields | High correlation |
2015 is highly correlated with 2013 and 6 other fields | High correlation |
2016 is highly correlated with 2013 and 6 other fields | High correlation |
2017 is highly correlated with 2013 and 6 other fields | High correlation |
2018 is highly correlated with 2013 and 7 other fields | High correlation |
2019 is highly correlated with 2013 and 7 other fields | High correlation |
2020 is highly correlated with 2013 and 7 other fields | High correlation |
LABEL2020 is highly correlated with LABEL2013 and 6 other fields | High correlation |
LABEL2017 is highly correlated with LABEL2013 and 6 other fields | High correlation |
LABEL2016 is highly correlated with LABEL2013 and 6 other fields | High correlation |
LABEL2018 is highly correlated with 2018 and 9 other fields | High correlation |
LABEL2019 is highly correlated with LABEL2013 and 6 other fields | High correlation |
LABEL2014 is highly correlated with LABEL2013 and 6 other fields | High correlation |
LABEL2013 is highly correlated with LABEL2014 and 6 other fields | High correlation |
LABEL2015 is highly correlated with LABEL2013 and 6 other fields | High correlation |
LABEL2013 has 24111 (91.1%) missing values | Missing |
LABEL2014 has 24081 (91.0%) missing values | Missing |
LABEL2015 has 24080 (91.0%) missing values | Missing |
LABEL2016 has 24080 (91.0%) missing values | Missing |
LABEL2017 has 24051 (90.9%) missing values | Missing |
LABEL2018 has 23977 (90.6%) missing values | Missing |
LABEL2019 has 23938 (90.5%) missing values | Missing |
LABEL2020 has 23893 (90.3%) missing values | Missing |
2013 has 21882 (82.7%) zeros | Zeros |
2014 has 19464 (73.6%) zeros | Zeros |
2015 has 16130 (61.0%) zeros | Zeros |
2016 has 14553 (55.0%) zeros | Zeros |
2017 has 13492 (51.0%) zeros | Zeros |
2018 has 13328 (50.4%) zeros | Zeros |
2019 has 12786 (48.3%) zeros | Zeros |
2020 has 12053 (45.6%) zeros | Zeros |
Reproduction
| Analysis started | 2022-09-22 15:21:32.640383 |
|---|---|
| Analysis finished | 2022-09-22 15:21:48.853745 |
| Duration | 16.21 seconds |
| Software version | pandas-profiling v3.3.0 |
| Download configuration | config.json |
LAT
Real number (ℝ≥0)
| Distinct | 192 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.40020996 |
| Minimum | 16.9375 |
|---|---|
| Maximum | 17.8925 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 206.8 KiB |
Quantile statistics
| Minimum | 16.9375 |
|---|---|
| 5-th percentile | 17.0325 |
| Q1 | 17.2175 |
| median | 17.3975 |
| Q3 | 17.5775 |
| 95-th percentile | 17.7825 |
| Maximum | 17.8925 |
| Range | 0.955 |
| Interquartile range (IQR) | 0.36 |
Descriptive statistics
| Standard deviation | 0.2316771372 |
|---|---|
| Coefficient of variation (CV) | 0.01331461734 |
| Kurtosis | -0.9851021179 |
| Mean | 17.40020996 |
| Median Absolute Deviation (MAD) | 0.18 |
| Skewness | 0.05136507924 |
| Sum | 460374.755 |
| Variance | 0.05367429588 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 17.4775 | 195 | 0.7% |
| 17.4975 | 195 | 0.7% |
| 17.5075 | 194 | 0.7% |
| 17.4925 | 194 | 0.7% |
| 17.5025 | 194 | 0.7% |
| 17.3125 | 193 | 0.7% |
| 17.4875 | 193 | 0.7% |
| 17.3075 | 193 | 0.7% |
| 17.3175 | 192 | 0.7% |
| 17.4725 | 192 | 0.7% |
| Other values (182) | 24523 |
| Value | Count | Frequency (%) |
| 16.9375 | 3 | < 0.1% |
| 16.9425 | 6 | < 0.1% |
| 16.9475 | 13 | < 0.1% |
| 16.9525 | 17 | 0.1% |
| 16.9575 | 22 | 0.1% |
| 16.9625 | 29 | |
| 16.9675 | 41 | |
| 16.9725 | 54 | |
| 16.9775 | 56 | |
| 16.9825 | 67 |
| Value | Count | Frequency (%) |
| 17.8925 | 3 | < 0.1% |
| 17.8875 | 9 | < 0.1% |
| 17.8825 | 10 | < 0.1% |
| 17.8775 | 11 | < 0.1% |
| 17.8725 | 18 | |
| 17.8675 | 28 | |
| 17.8625 | 31 | |
| 17.8575 | 36 | |
| 17.8525 | 41 | |
| 17.8475 | 42 |
LON
Real number (ℝ≥0)
| Distinct | 208 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 78.47750121 |
| Minimum | 78.0075 |
|---|---|
| Maximum | 79.0425 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 206.8 KiB |
Quantile statistics
| Minimum | 78.0075 |
|---|---|
| 5-th percentile | 78.1125 |
| Q1 | 78.2775 |
| median | 78.4775 |
| Q3 | 78.6675 |
| 95-th percentile | 78.8775 |
| Maximum | 79.0425 |
| Range | 1.035 |
| Interquartile range (IQR) | 0.39 |
Descriptive statistics
| Standard deviation | 0.2377368942 |
|---|---|
| Coefficient of variation (CV) | 0.003029363709 |
| Kurtosis | -0.9561032173 |
| Mean | 78.47750121 |
| Median Absolute Deviation (MAD) | 0.195 |
| Skewness | 0.1112356494 |
| Sum | 2076357.727 |
| Variance | 0.05651883084 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 78.5225 | 180 | 0.7% |
| 78.5475 | 179 | 0.7% |
| 78.5175 | 179 | 0.7% |
| 78.5125 | 179 | 0.7% |
| 78.4775 | 179 | 0.7% |
| 78.48251 | 178 | 0.7% |
| 78.5425 | 178 | 0.7% |
| 78.4625 | 178 | 0.7% |
| 78.4675 | 177 | 0.7% |
| 78.4725 | 177 | 0.7% |
| Other values (198) | 24674 |
| Value | Count | Frequency (%) |
| 78.0075 | 3 | < 0.1% |
| 78.0125 | 4 | < 0.1% |
| 78.0175 | 6 | < 0.1% |
| 78.0225 | 9 | < 0.1% |
| 78.0275 | 12 | < 0.1% |
| 78.0325 | 15 | 0.1% |
| 78.03751 | 19 | |
| 78.0425 | 21 | |
| 78.0475 | 27 | |
| 78.05251 | 45 |
| Value | Count | Frequency (%) |
| 79.0425 | 1 | < 0.1% |
| 79.03751 | 3 | < 0.1% |
| 79.0325 | 5 | < 0.1% |
| 79.0275 | 7 | |
| 79.0225 | 11 | |
| 79.0175 | 11 | |
| 79.0125 | 11 | |
| 79.0075 | 12 | |
| 79.0025 | 12 | |
| 78.99751 | 14 |
| Distinct | 4575 |
|---|---|
| Distinct (%) | 17.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 181.2147558 |
| Minimum | 0 |
|---|---|
| Maximum | 9756.0752 |
| Zeros | 21882 |
| Zeros (%) | 82.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 206.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 925.1541705 |
| Maximum | 9756.0752 |
| Range | 9756.0752 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 665.6123044 |
|---|---|
| Coefficient of variation (CV) | 3.673057976 |
| Kurtosis | 49.08335645 |
| Mean | 181.2147558 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.234235304 |
| Sum | 4794580.01 |
| Variance | 443039.7397 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 21882 | |
| 558.73682 | 2 | < 0.1% |
| 557.90344 | 2 | < 0.1% |
| 1616.15698 | 1 | < 0.1% |
| 13.75133 | 1 | < 0.1% |
| 318.39633 | 1 | < 0.1% |
| 1347.98767 | 1 | < 0.1% |
| 1841.99182 | 1 | < 0.1% |
| 1038.43005 | 1 | < 0.1% |
| 1039.05225 | 1 | < 0.1% |
| Other values (4565) | 4565 | 17.3% |
| Value | Count | Frequency (%) |
| 0 | 21882 | |
| 0.03219 | 1 | < 0.1% |
| 0.06535 | 1 | < 0.1% |
| 0.1866 | 1 | < 0.1% |
| 0.46692 | 1 | < 0.1% |
| 0.49837 | 1 | < 0.1% |
| 0.51894 | 1 | < 0.1% |
| 0.9422 | 1 | < 0.1% |
| 1.14573 | 1 | < 0.1% |
| 1.37343 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 9756.0752 | 1 | |
| 9685.37305 | 1 | |
| 9536.60547 | 1 | |
| 9107.61914 | 1 | |
| 9064.5752 | 1 | |
| 8957.39453 | 1 | |
| 8872.7373 | 1 | |
| 8367.24512 | 1 | |
| 8280.75488 | 1 | |
| 8267.02539 | 1 |
| Distinct | 6994 |
|---|---|
| Distinct (%) | 26.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 370.6823756 |
| Minimum | 0 |
|---|---|
| Maximum | 11829.86328 |
| Zeros | 19464 |
| Zeros (%) | 73.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 206.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 89.7007725 |
| 95-th percentile | 2184.591112 |
| Maximum | 11829.86328 |
| Range | 11829.86328 |
| Interquartile range (IQR) | 89.7007725 |
Descriptive statistics
| Standard deviation | 1106.442522 |
|---|---|
| Coefficient of variation (CV) | 2.984880304 |
| Kurtosis | 24.98761844 |
| Mean | 370.6823756 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.658906228 |
| Sum | 9807514.294 |
| Variance | 1224215.054 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 19464 | |
| 558.73682 | 2 | < 0.1% |
| 10183.80957 | 1 | < 0.1% |
| 8808.31055 | 1 | < 0.1% |
| 7647.25195 | 1 | < 0.1% |
| 4795.02783 | 1 | < 0.1% |
| 6213.41943 | 1 | < 0.1% |
| 3387.76367 | 1 | < 0.1% |
| 3498.34351 | 1 | < 0.1% |
| 1988.77515 | 1 | < 0.1% |
| Other values (6984) | 6984 | 26.4% |
| Value | Count | Frequency (%) |
| 0 | 19464 | |
| 0.03219 | 1 | < 0.1% |
| 0.06535 | 1 | < 0.1% |
| 0.46692 | 1 | < 0.1% |
| 0.49837 | 1 | < 0.1% |
| 1.14573 | 1 | < 0.1% |
| 1.37415 | 1 | < 0.1% |
| 1.54392 | 1 | < 0.1% |
| 1.58742 | 1 | < 0.1% |
| 1.66835 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 11829.86328 | 1 | |
| 10705.52832 | 1 | |
| 10586.00879 | 1 | |
| 10238.8291 | 1 | |
| 10185.72461 | 1 | |
| 10183.80957 | 1 | |
| 10181.1748 | 1 | |
| 10115.18652 | 1 | |
| 9963.30957 | 1 | |
| 9932.44824 | 1 |
| Distinct | 10327 |
|---|---|
| Distinct (%) | 39.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 611.610876 |
| Minimum | 0 |
|---|---|
| Maximum | 13333.70801 |
| Zeros | 16130 |
| Zeros (%) | 61.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 206.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 555.6220725 |
| 95-th percentile | 3829.096146 |
| Maximum | 13333.70801 |
| Range | 13333.70801 |
| Interquartile range (IQR) | 555.6220725 |
Descriptive statistics
| Standard deviation | 1454.79882 |
|---|---|
| Coefficient of variation (CV) | 2.378634647 |
| Kurtosis | 13.49591277 |
| Mean | 611.610876 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.519816701 |
| Sum | 16182000.56 |
| Variance | 2116439.607 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 16130 | |
| 545.43134 | 2 | < 0.1% |
| 553.50891 | 2 | < 0.1% |
| 6705.5249 | 1 | < 0.1% |
| 1626.40051 | 1 | < 0.1% |
| 89.93493 | 1 | < 0.1% |
| 732.25848 | 1 | < 0.1% |
| 571.89868 | 1 | < 0.1% |
| 23.73878 | 1 | < 0.1% |
| 25.30394 | 1 | < 0.1% |
| Other values (10317) | 10317 |
| Value | Count | Frequency (%) |
| 0 | 16130 | |
| 0.25951 | 1 | < 0.1% |
| 0.28713 | 1 | < 0.1% |
| 0.44198 | 1 | < 0.1% |
| 0.46692 | 1 | < 0.1% |
| 0.48031 | 1 | < 0.1% |
| 0.49837 | 1 | < 0.1% |
| 1.14573 | 1 | < 0.1% |
| 1.37415 | 1 | < 0.1% |
| 1.45313 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 13333.70801 | 1 | |
| 13245.87305 | 1 | |
| 12632.6709 | 1 | |
| 11832.88281 | 1 | |
| 11765.66797 | 1 | |
| 11589.7666 | 1 | |
| 11110.6377 | 1 | |
| 10918.5166 | 1 | |
| 10761.90332 | 1 | |
| 10725.00879 | 1 |
| Distinct | 11905 |
|---|---|
| Distinct (%) | 45.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 908.6649698 |
| Minimum | 0 |
|---|---|
| Maximum | 15308.28418 |
| Zeros | 14553 |
| Zeros (%) | 55.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 206.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 683.74295 |
| 95-th percentile | 5635.017672 |
| Maximum | 15308.28418 |
| Range | 15308.28418 |
| Interquartile range (IQR) | 683.74295 |
Descriptive statistics
| Standard deviation | 1899.490959 |
|---|---|
| Coefficient of variation (CV) | 2.090419486 |
| Kurtosis | 8.326804123 |
| Mean | 908.6649698 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.853740586 |
| Sum | 24041457.77 |
| Variance | 3608065.905 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 14553 | |
| 545.43134 | 2 | < 0.1% |
| 1581.91125 | 1 | < 0.1% |
| 1801.02246 | 1 | < 0.1% |
| 5240.9751 | 1 | < 0.1% |
| 2565.99463 | 1 | < 0.1% |
| 1725.89136 | 1 | < 0.1% |
| 1286.41138 | 1 | < 0.1% |
| 1172.07043 | 1 | < 0.1% |
| 860.46014 | 1 | < 0.1% |
| Other values (11895) | 11895 |
| Value | Count | Frequency (%) |
| 0 | 14553 | |
| 0.25951 | 1 | < 0.1% |
| 0.44198 | 1 | < 0.1% |
| 0.46692 | 1 | < 0.1% |
| 0.48031 | 1 | < 0.1% |
| 1.14573 | 1 | < 0.1% |
| 1.37415 | 1 | < 0.1% |
| 1.45313 | 1 | < 0.1% |
| 1.54492 | 1 | < 0.1% |
| 1.66835 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 15308.28418 | 1 | |
| 15274.26367 | 1 | |
| 13830.87695 | 1 | |
| 13477.22852 | 1 | |
| 13212.47949 | 1 | |
| 13006.55371 | 1 | |
| 12857.41309 | 1 | |
| 12790.9668 | 1 | |
| 12771.90625 | 1 | |
| 12601.27637 | 1 |
| Distinct | 12967 |
|---|---|
| Distinct (%) | 49.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1040.993616 |
| Minimum | 0 |
|---|---|
| Maximum | 15504.71875 |
| Zeros | 13492 |
| Zeros (%) | 51.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 206.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 908.89009 |
| 95-th percentile | 6082.00422 |
| Maximum | 15504.71875 |
| Range | 15504.71875 |
| Interquartile range (IQR) | 908.89009 |
Descriptive statistics
| Standard deviation | 2016.483117 |
|---|---|
| Coefficient of variation (CV) | 1.937075392 |
| Kurtosis | 6.674668357 |
| Mean | 1040.993616 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.589383927 |
| Sum | 27542609.09 |
| Variance | 4066204.16 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 13492 | |
| 1185.23572 | 1 | < 0.1% |
| 1143.42493 | 1 | < 0.1% |
| 404.53268 | 1 | < 0.1% |
| 58.28705 | 1 | < 0.1% |
| 4.50438 | 1 | < 0.1% |
| 658.14856 | 1 | < 0.1% |
| 402.2626 | 1 | < 0.1% |
| 148.54636 | 1 | < 0.1% |
| 1129.0575 | 1 | < 0.1% |
| Other values (12957) | 12957 |
| Value | Count | Frequency (%) |
| 0 | 13492 | |
| 0.08405 | 1 | < 0.1% |
| 0.25951 | 1 | < 0.1% |
| 0.44198 | 1 | < 0.1% |
| 0.58611 | 1 | < 0.1% |
| 1.12765 | 1 | < 0.1% |
| 1.35177 | 1 | < 0.1% |
| 1.37415 | 1 | < 0.1% |
| 1.45313 | 1 | < 0.1% |
| 1.54492 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 15504.71875 | 1 | |
| 15395.75586 | 1 | |
| 13878.33203 | 1 | |
| 13849.60156 | 1 | |
| 13776.4248 | 1 | |
| 13468.87793 | 1 | |
| 13252.1123 | 1 | |
| 13139.76367 | 1 | |
| 12893.33398 | 1 | |
| 12794.59668 | 1 |
| Distinct | 13131 |
|---|---|
| Distinct (%) | 49.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1058.922911 |
| Minimum | 0 |
|---|---|
| Maximum | 15504.71875 |
| Zeros | 13328 |
| Zeros (%) | 50.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 206.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 935.9483525 |
| 95-th percentile | 6186.540235 |
| Maximum | 15504.71875 |
| Range | 15504.71875 |
| Interquartile range (IQR) | 935.9483525 |
Descriptive statistics
| Standard deviation | 2034.872139 |
|---|---|
| Coefficient of variation (CV) | 1.921643321 |
| Kurtosis | 6.528489641 |
| Mean | 1058.922911 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.565468639 |
| Sum | 28016982.37 |
| Variance | 4140704.622 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 13328 | |
| 2937.95996 | 1 | < 0.1% |
| 404.53268 | 1 | < 0.1% |
| 58.28705 | 1 | < 0.1% |
| 4.50438 | 1 | < 0.1% |
| 658.14856 | 1 | < 0.1% |
| 402.2626 | 1 | < 0.1% |
| 148.54636 | 1 | < 0.1% |
| 1129.0575 | 1 | < 0.1% |
| 738.9458 | 1 | < 0.1% |
| Other values (13121) | 13121 |
| Value | Count | Frequency (%) |
| 0 | 13328 | |
| 0.08405 | 1 | < 0.1% |
| 0.25951 | 1 | < 0.1% |
| 0.58611 | 1 | < 0.1% |
| 1.12765 | 1 | < 0.1% |
| 1.35177 | 1 | < 0.1% |
| 1.37415 | 1 | < 0.1% |
| 1.45313 | 1 | < 0.1% |
| 1.54492 | 1 | < 0.1% |
| 1.66835 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 15504.71875 | 1 | |
| 15395.75586 | 1 | |
| 13878.33203 | 1 | |
| 13849.60156 | 1 | |
| 13776.4248 | 1 | |
| 13468.87793 | 1 | |
| 13252.23828 | 1 | |
| 13188.44531 | 1 | |
| 13139.76367 | 1 | |
| 12893.33398 | 1 |
| Distinct | 13673 |
|---|---|
| Distinct (%) | 51.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1089.072842 |
| Minimum | 0 |
|---|---|
| Maximum | 15504.71875 |
| Zeros | 12786 |
| Zeros (%) | 48.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 206.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 65.655935 |
| Q3 | 982.481475 |
| 95-th percentile | 6297.73337 |
| Maximum | 15504.71875 |
| Range | 15504.71875 |
| Interquartile range (IQR) | 982.481475 |
Descriptive statistics
| Standard deviation | 2062.366628 |
|---|---|
| Coefficient of variation (CV) | 1.893690254 |
| Kurtosis | 6.460156517 |
| Mean | 1089.072842 |
| Median Absolute Deviation (MAD) | 65.655935 |
| Skewness | 2.552863733 |
| Sum | 28814689.27 |
| Variance | 4253356.108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 12786 | |
| 634.9527 | 1 | < 0.1% |
| 923.92059 | 1 | < 0.1% |
| 660.62616 | 1 | < 0.1% |
| 2654.92358 | 1 | < 0.1% |
| 2640.26147 | 1 | < 0.1% |
| 1210.97314 | 1 | < 0.1% |
| 2598.6394 | 1 | < 0.1% |
| 1256.33276 | 1 | < 0.1% |
| 91.70498 | 1 | < 0.1% |
| Other values (13663) | 13663 |
| Value | Count | Frequency (%) |
| 0 | 12786 | |
| 0.08405 | 1 | < 0.1% |
| 0.25951 | 1 | < 0.1% |
| 0.49387 | 1 | < 0.1% |
| 0.58611 | 1 | < 0.1% |
| 1.12765 | 1 | < 0.1% |
| 1.35177 | 1 | < 0.1% |
| 1.45313 | 1 | < 0.1% |
| 1.54492 | 1 | < 0.1% |
| 1.59675 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 15504.71875 | 1 | |
| 15398.2627 | 1 | |
| 14757.6543 | 1 | |
| 14688.9248 | 1 | |
| 13878.33203 | 1 | |
| 13858.42871 | 1 | |
| 13776.4248 | 1 | |
| 13468.87793 | 1 | |
| 13252.23828 | 1 | |
| 13139.76367 | 1 |
| Distinct | 14406 |
|---|---|
| Distinct (%) | 54.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1125.372468 |
| Minimum | 0 |
|---|---|
| Maximum | 15504.71875 |
| Zeros | 12053 |
| Zeros (%) | 45.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 206.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 179.797 |
| Q3 | 1031.984952 |
| 95-th percentile | 6382.693894 |
| Maximum | 15504.71875 |
| Range | 15504.71875 |
| Interquartile range (IQR) | 1031.984952 |
Descriptive statistics
| Standard deviation | 2084.433683 |
|---|---|
| Coefficient of variation (CV) | 1.852216705 |
| Kurtosis | 6.243365806 |
| Mean | 1125.372468 |
| Median Absolute Deviation (MAD) | 179.797 |
| Skewness | 2.518042048 |
| Sum | 29775104.75 |
| Variance | 4344863.781 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 12053 | |
| 536.99463 | 1 | < 0.1% |
| 643.71686 | 1 | < 0.1% |
| 554.52521 | 1 | < 0.1% |
| 584.31262 | 1 | < 0.1% |
| 947.17914 | 1 | < 0.1% |
| 1818.9978 | 1 | < 0.1% |
| 418.28308 | 1 | < 0.1% |
| 58.28705 | 1 | < 0.1% |
| 4.50438 | 1 | < 0.1% |
| Other values (14396) | 14396 |
| Value | Count | Frequency (%) |
| 0 | 12053 | |
| 0.08405 | 1 | < 0.1% |
| 0.49387 | 1 | < 0.1% |
| 0.58611 | 1 | < 0.1% |
| 1.12765 | 1 | < 0.1% |
| 1.35177 | 1 | < 0.1% |
| 1.45313 | 1 | < 0.1% |
| 1.59675 | 1 | < 0.1% |
| 1.66835 | 1 | < 0.1% |
| 2.0297 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 15504.71875 | 1 | |
| 15398.2627 | 1 | |
| 14757.6543 | 1 | |
| 14688.9248 | 1 | |
| 13914.3916 | 1 | |
| 13878.33203 | 1 | |
| 13858.42871 | 1 | |
| 13468.87793 | 1 | |
| 13252.0293 | 1 | |
| 13139.78711 | 1 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 24111 |
| Missing (%) | 91.1% |
| Memory size | 206.8 KiB |
| Urban | |
|---|---|
| Water | 27 |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Characters and Unicode
| Total characters | 11735 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Urban |
|---|---|
| 2nd row | Urban |
| 3rd row | Urban |
| 4th row | Urban |
| 5th row | Urban |
Common Values
| Value | Count | Frequency (%) |
| Urban | 2320 | 8.8% |
| Water | 27 | 0.1% |
| (Missing) | 24111 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| urban | 2320 | |
| water | 27 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 2347 | |
| a | 2347 | |
| U | 2320 | |
| b | 2320 | |
| n | 2320 | |
| W | 27 | 0.2% |
| t | 27 | 0.2% |
| e | 27 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9388 | |
| Uppercase Letter | 2347 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 2347 | |
| a | 2347 | |
| b | 2320 | |
| n | 2320 | |
| t | 27 | 0.3% |
| e | 27 | 0.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 2320 | |
| W | 27 | 1.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11735 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 2347 | |
| a | 2347 | |
| U | 2320 | |
| b | 2320 | |
| n | 2320 | |
| W | 27 | 0.2% |
| t | 27 | 0.2% |
| e | 27 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11735 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 2347 | |
| a | 2347 | |
| U | 2320 | |
| b | 2320 | |
| n | 2320 | |
| W | 27 | 0.2% |
| t | 27 | 0.2% |
| e | 27 | 0.2% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 24081 |
| Missing (%) | 91.0% |
| Memory size | 206.8 KiB |
| Urban | |
|---|---|
| Water | 27 |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Characters and Unicode
| Total characters | 11885 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Urban |
|---|---|
| 2nd row | Urban |
| 3rd row | Urban |
| 4th row | Urban |
| 5th row | Urban |
Common Values
| Value | Count | Frequency (%) |
| Urban | 2350 | 8.9% |
| Water | 27 | 0.1% |
| (Missing) | 24081 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| urban | 2350 | |
| water | 27 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 2377 | |
| a | 2377 | |
| U | 2350 | |
| b | 2350 | |
| n | 2350 | |
| W | 27 | 0.2% |
| t | 27 | 0.2% |
| e | 27 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9508 | |
| Uppercase Letter | 2377 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 2377 | |
| a | 2377 | |
| b | 2350 | |
| n | 2350 | |
| t | 27 | 0.3% |
| e | 27 | 0.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 2350 | |
| W | 27 | 1.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11885 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 2377 | |
| a | 2377 | |
| U | 2350 | |
| b | 2350 | |
| n | 2350 | |
| W | 27 | 0.2% |
| t | 27 | 0.2% |
| e | 27 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11885 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 2377 | |
| a | 2377 | |
| U | 2350 | |
| b | 2350 | |
| n | 2350 | |
| W | 27 | 0.2% |
| t | 27 | 0.2% |
| e | 27 | 0.2% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 24080 |
| Missing (%) | 91.0% |
| Memory size | 206.8 KiB |
| Urban | |
|---|---|
| Water | 22 |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Characters and Unicode
| Total characters | 11890 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Urban |
|---|---|
| 2nd row | Urban |
| 3rd row | Urban |
| 4th row | Urban |
| 5th row | Urban |
Common Values
| Value | Count | Frequency (%) |
| Urban | 2356 | 8.9% |
| Water | 22 | 0.1% |
| (Missing) | 24080 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| urban | 2356 | |
| water | 22 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 2378 | |
| a | 2378 | |
| U | 2356 | |
| b | 2356 | |
| n | 2356 | |
| W | 22 | 0.2% |
| t | 22 | 0.2% |
| e | 22 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9512 | |
| Uppercase Letter | 2378 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 2378 | |
| a | 2378 | |
| b | 2356 | |
| n | 2356 | |
| t | 22 | 0.2% |
| e | 22 | 0.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 2356 | |
| W | 22 | 0.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11890 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 2378 | |
| a | 2378 | |
| U | 2356 | |
| b | 2356 | |
| n | 2356 | |
| W | 22 | 0.2% |
| t | 22 | 0.2% |
| e | 22 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11890 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 2378 | |
| a | 2378 | |
| U | 2356 | |
| b | 2356 | |
| n | 2356 | |
| W | 22 | 0.2% |
| t | 22 | 0.2% |
| e | 22 | 0.2% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 24080 |
| Missing (%) | 91.0% |
| Memory size | 206.8 KiB |
| Urban | |
|---|---|
| Water | 18 |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Characters and Unicode
| Total characters | 11890 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Urban |
|---|---|
| 2nd row | Urban |
| 3rd row | Urban |
| 4th row | Urban |
| 5th row | Urban |
Common Values
| Value | Count | Frequency (%) |
| Urban | 2360 | 8.9% |
| Water | 18 | 0.1% |
| (Missing) | 24080 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| urban | 2360 | |
| water | 18 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 2378 | |
| a | 2378 | |
| U | 2360 | |
| b | 2360 | |
| n | 2360 | |
| W | 18 | 0.2% |
| t | 18 | 0.2% |
| e | 18 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9512 | |
| Uppercase Letter | 2378 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 2378 | |
| a | 2378 | |
| b | 2360 | |
| n | 2360 | |
| t | 18 | 0.2% |
| e | 18 | 0.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 2360 | |
| W | 18 | 0.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11890 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 2378 | |
| a | 2378 | |
| U | 2360 | |
| b | 2360 | |
| n | 2360 | |
| W | 18 | 0.2% |
| t | 18 | 0.2% |
| e | 18 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11890 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 2378 | |
| a | 2378 | |
| U | 2360 | |
| b | 2360 | |
| n | 2360 | |
| W | 18 | 0.2% |
| t | 18 | 0.2% |
| e | 18 | 0.2% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 24051 |
| Missing (%) | 90.9% |
| Memory size | 206.8 KiB |
| Urban | |
|---|---|
| Water | 36 |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Characters and Unicode
| Total characters | 12035 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Urban |
|---|---|
| 2nd row | Urban |
| 3rd row | Urban |
| 4th row | Urban |
| 5th row | Urban |
Common Values
| Value | Count | Frequency (%) |
| Urban | 2371 | 9.0% |
| Water | 36 | 0.1% |
| (Missing) | 24051 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| urban | 2371 | |
| water | 36 | 1.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 2407 | |
| a | 2407 | |
| U | 2371 | |
| b | 2371 | |
| n | 2371 | |
| W | 36 | 0.3% |
| t | 36 | 0.3% |
| e | 36 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9628 | |
| Uppercase Letter | 2407 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 2407 | |
| a | 2407 | |
| b | 2371 | |
| n | 2371 | |
| t | 36 | 0.4% |
| e | 36 | 0.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 2371 | |
| W | 36 | 1.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12035 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 2407 | |
| a | 2407 | |
| U | 2371 | |
| b | 2371 | |
| n | 2371 | |
| W | 36 | 0.3% |
| t | 36 | 0.3% |
| e | 36 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12035 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 2407 | |
| a | 2407 | |
| U | 2371 | |
| b | 2371 | |
| n | 2371 | |
| W | 36 | 0.3% |
| t | 36 | 0.3% |
| e | 36 | 0.3% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 23977 |
| Missing (%) | 90.6% |
| Memory size | 206.8 KiB |
| Urban | |
|---|---|
| Water | 49 |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Characters and Unicode
| Total characters | 12405 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Urban |
|---|---|
| 2nd row | Urban |
| 3rd row | Urban |
| 4th row | Urban |
| 5th row | Urban |
Common Values
| Value | Count | Frequency (%) |
| Urban | 2432 | 9.2% |
| Water | 49 | 0.2% |
| (Missing) | 23977 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| urban | 2432 | |
| water | 49 | 2.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 2481 | |
| a | 2481 | |
| U | 2432 | |
| b | 2432 | |
| n | 2432 | |
| W | 49 | 0.4% |
| t | 49 | 0.4% |
| e | 49 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9924 | |
| Uppercase Letter | 2481 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 2481 | |
| a | 2481 | |
| b | 2432 | |
| n | 2432 | |
| t | 49 | 0.5% |
| e | 49 | 0.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 2432 | |
| W | 49 | 2.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12405 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 2481 | |
| a | 2481 | |
| U | 2432 | |
| b | 2432 | |
| n | 2432 | |
| W | 49 | 0.4% |
| t | 49 | 0.4% |
| e | 49 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12405 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 2481 | |
| a | 2481 | |
| U | 2432 | |
| b | 2432 | |
| n | 2432 | |
| W | 49 | 0.4% |
| t | 49 | 0.4% |
| e | 49 | 0.4% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 23938 |
| Missing (%) | 90.5% |
| Memory size | 206.8 KiB |
| Urban | |
|---|---|
| Water | 35 |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Characters and Unicode
| Total characters | 12600 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Urban |
|---|---|
| 2nd row | Urban |
| 3rd row | Urban |
| 4th row | Urban |
| 5th row | Urban |
Common Values
| Value | Count | Frequency (%) |
| Urban | 2485 | 9.4% |
| Water | 35 | 0.1% |
| (Missing) | 23938 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| urban | 2485 | |
| water | 35 | 1.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 2520 | |
| a | 2520 | |
| U | 2485 | |
| b | 2485 | |
| n | 2485 | |
| W | 35 | 0.3% |
| t | 35 | 0.3% |
| e | 35 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10080 | |
| Uppercase Letter | 2520 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 2520 | |
| a | 2520 | |
| b | 2485 | |
| n | 2485 | |
| t | 35 | 0.3% |
| e | 35 | 0.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 2485 | |
| W | 35 | 1.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12600 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 2520 | |
| a | 2520 | |
| U | 2485 | |
| b | 2485 | |
| n | 2485 | |
| W | 35 | 0.3% |
| t | 35 | 0.3% |
| e | 35 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12600 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 2520 | |
| a | 2520 | |
| U | 2485 | |
| b | 2485 | |
| n | 2485 | |
| W | 35 | 0.3% |
| t | 35 | 0.3% |
| e | 35 | 0.3% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 23893 |
| Missing (%) | 90.3% |
| Memory size | 206.8 KiB |
| Urban | |
|---|---|
| Water | 15 |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Characters and Unicode
| Total characters | 12825 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Urban |
|---|---|
| 2nd row | Urban |
| 3rd row | Urban |
| 4th row | Urban |
| 5th row | Urban |
Common Values
| Value | Count | Frequency (%) |
| Urban | 2550 | 9.6% |
| Water | 15 | 0.1% |
| (Missing) | 23893 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| urban | 2550 | |
| water | 15 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 2565 | |
| a | 2565 | |
| U | 2550 | |
| b | 2550 | |
| n | 2550 | |
| W | 15 | 0.1% |
| t | 15 | 0.1% |
| e | 15 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10260 | |
| Uppercase Letter | 2565 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 2565 | |
| a | 2565 | |
| b | 2550 | |
| n | 2550 | |
| t | 15 | 0.1% |
| e | 15 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 2550 | |
| W | 15 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12825 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 2565 | |
| a | 2565 | |
| U | 2550 | |
| b | 2550 | |
| n | 2550 | |
| W | 15 | 0.1% |
| t | 15 | 0.1% |
| e | 15 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12825 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 2565 | |
| a | 2565 | |
| U | 2550 | |
| b | 2550 | |
| n | 2550 | |
| W | 15 | 0.1% |
| t | 15 | 0.1% |
| e | 15 | 0.1% |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| LAT | LON | 2013 | 2014 | 2015 | 2016 | 2017 | 2018 | 2019 | 2020 | LABEL2013 | LABEL2014 | LABEL2015 | LABEL2016 | LABEL2017 | LABEL2018 | LABEL2019 | LABEL2020 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 17.3225 | 78.0075 | 0.0 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | None | None | None | None | None | None | None | None |
| 1 | 17.3275 | 78.0075 | 0.0 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | None | None | None | None | None | None | None | None |
| 2 | 17.3325 | 78.0075 | 0.0 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | None | None | None | None | None | None | None | None |
| 3 | 17.3225 | 78.0125 | 0.0 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | None | None | None | None | None | None | None | None |
| 4 | 17.3275 | 78.0125 | 0.0 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | None | None | None | None | None | None | None | None |
| 5 | 17.3325 | 78.0125 | 0.0 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | None | None | None | None | None | None | None | None |
| 6 | 17.3375 | 78.0125 | 0.0 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | None | None | None | None | None | None | None | None |
| 7 | 17.3125 | 78.0175 | 0.0 | 636.69885 | 636.69885 | 636.69885 | 636.69885 | 636.69885 | 636.69885 | 636.69885 | None | None | None | None | None | None | None | None |
| 8 | 17.3175 | 78.0175 | 0.0 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | None | None | None | None | None | None | None | None |
| 9 | 17.3225 | 78.0175 | 0.0 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | None | None | None | None | None | None | None | None |
Last rows
| LAT | LON | 2013 | 2014 | 2015 | 2016 | 2017 | 2018 | 2019 | 2020 | LABEL2013 | LABEL2014 | LABEL2015 | LABEL2016 | LABEL2017 | LABEL2018 | LABEL2019 | LABEL2020 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 26448 | 17.5175 | 79.02750 | 0.0 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 224.03043 | None | None | None | None | None | None | None | None |
| 26449 | 17.4925 | 79.03250 | 0.0 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | None | None | None | None | None | None | None | None |
| 26450 | 17.4975 | 79.03250 | 0.0 | 544.25580 | 544.25580 | 544.43494 | 544.43494 | 544.43494 | 544.43494 | 544.43494 | None | None | None | None | None | None | None | None |
| 26451 | 17.5025 | 79.03250 | 0.0 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | None | None | None | None | None | None | None | None |
| 26452 | 17.5075 | 79.03250 | 0.0 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | None | None | None | None | None | None | None | None |
| 26453 | 17.5125 | 79.03250 | 0.0 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | None | None | None | None | None | None | None | None |
| 26454 | 17.4975 | 79.03751 | 0.0 | 545.16174 | 545.16174 | 544.43805 | 544.43805 | 544.43805 | 544.43805 | 544.43805 | None | None | None | None | None | None | None | None |
| 26455 | 17.5025 | 79.03751 | 0.0 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | None | None | None | None | None | None | None | None |
| 26456 | 17.5075 | 79.03751 | 0.0 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | None | None | None | None | None | None | None | None |
| 26457 | 17.5075 | 79.04250 | 0.0 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | None | None | None | None | None | None | None | None |